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(57) Abstract 

Equine rhinovinis 1 (ERhVl) is a respiratory pathogen of horses which has an uncertain taxonomic status. The nucleotide sequence 
of the ERhVl genome and amino acid sequence have been substantially determined (figure 2). The predicted polyprotein was encoded by 
6.741 nucleotides and possessed a typical picomavirus proteolytic cleavage pattern, including a leader polypeptide. The genomic structure 
and predicted amino acid sequence of ERhVl were more similar to those of foot-and-mouth disease viruses (FMDV), the only members of 
the aphthovirus genus, than other picomaviruscs. Nucleotide sequences coding for the complete polyprotein. the polymerase, and VP1 were 
analyzed separately. The phylogenetic trees confirmed that ERhVl was more closely related to aphthoviruses than to other picoma viruses. 
Virion proteins and virus-like panicles are described and probes, primers, antigens, vectors, diagnostics and tests developed. 
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EQUINE RHINOVIRUS 1 PROTEINS 
rvrpontrcTiON to rNVENTlON 

This invention relates to the equine rhinovirus 1 (ERhVl) which has been 
sequenced and characterized. In particular, the invention relates to nucleotide and 
5 protein sequences of ERhVl and a range of clinical and diagnostic products derived 
from ERhVl. 

RArttrriROiiND of invention 

Equine rhinovirus 1 (ERhVl) was first isolated from horses in the United 
Kingdom and subsequendy from horses in mainland Europe, the USA and 

1 0 Australia. Most isolates were from the nasopharynx of horses with an acute, febrile 
respiratory disease. Virions had the characteristic size and morphology of 
picomaviruses and were acid-labile. Two other serologically distinct, acid-labile 
picomaviruses, ERhV2 and ERhV3, have also been isolated from horses. 

Considerable uncertainty has surrounded the classification of ERhVl. 

15 Physicochemical studies have shown that the nucleic acid density and base 
composition of ERhVl differ from those of rhinoviruses. In contrast to 
rhinoviruses, ERhVl has a broad host-cell range in vitro and in vivo and there is 
no evidence of extensive antigenic variation. Infection of horses with ERhVl 
causes a disease characterized by an acute febrile respiratory disease accompanied 

20 by anemia, fecal and urine shedding and viral persistence. The signs of systemic 
infection and persistence are not characteristic of rhinovirus infections in other 
species. The known host range of ERhVl is broad and includes rabbits, guinea 
pigs, monkeys and humans, although in these species the virus does not appear to 
spread horizontally. There is both experimental and epidemiological evidence of 

25 ERhVl infection of humans. A human volunteer inoculated intranasally with 
ERhVl developed severe pharyngitis, lymphadenitis, fever and viremia, and high 
ERhVl antibody titers were found in the sera of 3 of 12 stable workers whereas no 
ERhVl antibody was found in the sera of 159 non-stable workers. 

In order to clarify the taxonomic status of ERhVl, a detailed study was 

30 undertaken to determine the nucleotide and amino acid sequence of ERhVl. The 
resultant studies provided the complete nucleotide sequence of the gene encoding 
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the ERhVl poiyprotcin and the 3'-nontranslated region (NTR) as well as pan of the 
nucleotide sequence of the 5 'NTR. The amino acid sequence of the various ERhVl 
proteins was deduced from the nucleotide sequence. 

The analysis of the nucleotide sequence of ERhVl confirmed previous 
5 studies which indicated that many properties of ERhVl are not consistent with 
those of other members of the genus Rhinovirus. Indeed many of the 
physicochemical and biological properties of ERhVl have suggested ERhVl is 
more closely related to foot-and-mouth disease virus (FMDV) the sole member of 
the Alpthovirus genus. In addition to the overall sequence similarity, several 

10 features of the ERhVl genome are similar to those of FMDV. The ERhVl L 
protein is most similar to its counterpart in aphthoviruses in both length, 207 amino 
acids in ERhVl and 201 in FMDV, and in amino acid sequence identity. In 
aphthoviruses, the L protein catalyses its own cleavage from the potyprotein, and 
mediates cleavage of the p220 component of the cap-binding complex leading to 

15 inhibition of translation of capped rnRNAs. Cardiovirus L proteins are only 67-76 
amino acids long and are not auto catalytic. In contrast to the cardioviruses, 
aphthoviruses utilize two distinct initiation codons, which results in different forms 
of the L protein, Lab and Lb, differing from each other by 28 amino acids at their 
N-termini. 

20 The second initiation codon occurs in a more favourable context, which is 

presumably the reason why Lb, the smaller of the two proteins, is the predominant 
species. Thus far, differences in the function of the two FMDV L proteins have not 
been detected. ERhVl also possesses a second ATG, 63 bases downstream from 
the first optimal ATG, which is also present in a context optimal for initiation of 

25 translation. Translation from this ATG would result in an L protein with 21 fewer 
amino acids at its N-terrninus. Therefore, it is probable that ERhVl possesses a 
second species of L protein, similar to the FMDV Lb protein. If so, the reason for 
the existence and conservation of two forms of the L protein in ERhVl and FMDV 
is an intriguing question. Curiously, ERhVl has tand mly repeated ATG codons 

30 at each of the possible initiation sites, where the first ATG in each case does not 
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occur in a context optimal for translation. The role of these ATGs may be to 
ensure that translation is initiated from both possible initiation sites. 

The 2A protease is only 16 amino acids in length in both FMDV and 
ERhVl, compared to 142-149 amino acids in other picornaviruses. In FMDV 2A 
protease cleaves at its C-terminus but, unlike the 2A protease of other 
picornaviruses, appears not to have a role in shut down of host cell macromolecular 
synthesis. The high degree of conservation of the FMDV and ERhVl 2A proteins 
is intriguing and suggests an important role for this protein in the diseases produced 
by these viruses. 

It may be expected that the tree derived from the complete polyprot in 
coding sequence would provide the most representative view of the taxonomic 
status of ERhVl by reducing any bias imparted by using restricted parts of the 
genome with highly variable evolutionary rates. However, such analysis is 
restricted because there are only a few complete polyprotein sequences available. 
The polymerase genes are the most conserved genes in positive strand RNA viruses 
and they have been used to construct a taxonomy, and to predict the ancient roots, 
of these viruses. In contrast to the polymerase gene, the VP1 gene encodes the 
major antigenic determinants of the virus and evolves more rapidly than other 
regions in the genome. The diversity of VP1 regions make them useful for the 
study of closely related picornaviruses. Thus, trees based on the polymerase and 
VP1 genes presumably reflect the extremes of evolutionary rates from which the 
taxonomic status and evolutionary origin of ERhVl could be identified. The 
ERhVl VP1 amino acid sequence was more similar to FMDV than to any other 
sequence in the data base; this was true even when representative segments across 
the entire sequence were separately analysed. 

Therefore, we consider that the difference in the topology of the VP I, 
compared to the other two trees, is most unlikely to be a consequence of gen tic 
recombination. The topographic differences between the three ERhVl trees 
compared to those of aphthoviruses, particularly the VP1 derived trees, as well as 
the presence of only one VPg gene in ERhVl genome, leads us to conclude that 
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ERhVI is probably a member of a distinct genus proposed to be called 
Equirhinovirus \ 

The reassessment of the taxonomic status of ERhVI focuses on a 
requirement to reassess the biology of the vims particularly with respect to the 
nature of clinical disease as well as means for control by vaccination and improved 
methods of diagnosis. For example, cardioviruses and aphthoviruses cause viremic 
infections accompanied by myocarditis. Clinical disease caused by ERhVI is 
generally considered to be confined to the respiratory tract even though there is a 
viremia and the virus is shed in faeces and urine. Whether ERhVI infection 
produces systemic disease similar to that observed in aphthovirus or cardiovirus 
infections, including the production of myocarditis, needs to be investigated. There 
is serological evidence that the incidence of ERhVI infection is as high as 50% in 
some horse populations however, the number of reported isolations of ERhVI is 
very small. We have clear evidence that primary isolation of the virus from clinical 
specimens is known to be difficult, suggesting that the true incidence of ERhVI 
disease is much greater than reported. 

The determination of the complete nucleotide sequence of ERhVI 
polyproiein has important practical applications in developing novel methods for the 
diagnosis and control of ERhV disease in horses and other species. 
OBJECT AND STATEMENT OF INVENTION 



In one aspect, the invention provides a substantially pure nucleotide sequence 
for ERhVI being: 



a substantially pure nucleotide sequence for ERhVI being: 




CCGTCAAGCC 


CGTTGCCTGT 


ATAGCCAGGT 


AACCGGACAG 


CGGCTTG CTG 


GATTTTCCCG 


-375 


GTGCCATTGC 


TCTGGATGGT 


GTCACCAAGC 


TGACAAATGC 


GGAGTGAACC 


TCACAAAGCG 


-315 


ACACGCCTGT 


GGTAGCGCTG 


CCCAAAAGGG 


AGCGGAACTC 


CCCGCCGAGG 


CGGTCCTCTC 


-255 


TGGCCAAAAG 


CCCAG CGTTG 


ATAGCGCCTT 


TTGGGATGCA 


GGAACCCCAC 


CTGCCAGGTG 


-195 


TGAAGTGGAG 


TGAGCGGATC 


TCCAATTTGG 


TCTGTTCTGA 


ACTACAC CAT 


TTACTGCTGT 


-135 


GAAGAATGCC 


CTGGAGGCAA 


G CTGGTT AC A 


GCCCTGACCA 


GGCCCTGCCC 


GTGACTCTCG 


-75 


ACCGGCGCAG 


GGTCAAAAAT 


TGTCTAAGCA 


GCAGCAGGAA 


CGCGGGAGCG 


TTTCTTTTCC 


-15 


TTTTGTACTG ACATGATGGC 


GGCGTCTAAG 


GTGTATAGAG 


TTTGCGAGCA 


GACTCTGCTG 


45 


GCAGGTGCCG 


TTCGCATGAT 


GGACAAATTC 


TTGCAAAAGA 


GAACTGTTTT 


TGTCCCCCAT 


105 


CTTGACAAAA 


CAATTCGTrr 


GACTGGACTC 


CACAATTATG 


ACAATACTTG 


CTGGTTGAAT 


165 


GCCTTGACAC 


AACTGACACA 


GATTCTTGGA 


ATTCGGCTTT 


TTGATGAACA 


CTTCGGCAAT 


225 


AGAGGTCTGT 


TCACTCGGAA 


AACAATTGAT 


TGGGTGAGTG 


ACCAGACTGG 


TATAAAAGAT 


285 
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CTAAAATCAG GAGCACCGCC ACTCGTGGTG GTGTACAAAC TGTGGCAACA TGCACACTTC 345 
GATGTCGGTA CGATGGAGAA ACCCCGGTCG ATTACTCTAT GGTCTGGCCC CAAAGTGTGT 4 OS 

CTTTCTGATT TCTGGG CCTG TGTTTCGG CA AAACCGGGAC ATGCAGTATT CTACCTTCTC 46 5 

ACAAGCGAGG GTTGGATCTG TGTTGATGAC AAGAAAATAT ACCCAGAAAC ACCCAAAACA 525 

5 GAGGATGTAC TTGTTTTTGC GCCCTATGAC TTTGAGTCAC TGGGCAAGGA CCCACCAAAG 585 

CTACACCAGA GATATGAAAA AGCATTTGAG CTCAGTGGCG GAGGTACATC CACTCCAACA 64 5 

ACTGGCAACC AAAACATGTC CGGAAACAGT GGTTCAATTG TTCAAAATTT TTACATGCAA 705 

CAGTACCAGA ATTCAATTGA CG CAGACCTG GGAGACAATG TGATTAGCCC TGAAGGCCAG 765 

GGCAGCAACA CTAGTAGTTC AACCTCATCA AGCCAATCCT CTGGCTTGGG CGGGTGGTTC 825 

10 TCTAGTTTGC TGAACCTTGG AACAAAACTA CTGGCTGACA AGAAGACAGA AGAGACTACA 88 5 

AACATTGAAG ACAGAATTGA AACAACAGTG GTTGGAGTCA CTATTATTAA TTCACAAGGA 945 

TCTGTTGGAA CAACCTACTG TTACTCCAAA CCGGATGGTA GACCACCATC CACAGTGTCA 1005 

GACCCAGTTA CCAGACTTGG ACCCACGCTT TCCAGGCACT ACACATTTAA GGTAGGTGAG 106 5 

TGGCCCCATT CTCAATCACA TGGTCACGCA TGGATCTGTC CGTTGCCAGG TGACAAACTC 1125 

AAGAAGATGG GCAGTTTTCA TGAGGTTGTC AAAGCCCACC AC CTG GTCAA GAACGGCTGG 118 5 

GATGTGGTTG TGCAGGTGAA TCCCTCATTT GCTCACTCCG GGCCGCTGTG TGTAGCAGCA 124 5 

GTGCCGGAGT ACGAACACAC ACATGAGAAA GCACTCAAGT GGTCTGAGCT TGAGGAACCA 13 0 5 

GCTTACACAT ACCAACAACT TTCAGTTTTT CCCCACCAGT TGCTAAATTT GAGGACAAAT 1365 

TCATCAGTGC ATTTGGTGAT GCCCTACATT GGGCCAGGCC AACCAACAAA TCTGACTTTG 1425 

CACAACCCGT GGACCATTGT TATTTTAATT TTGTCTGAAT TGACAGGACC TGGCCAAACT 1485 

GTGCCTGTGA CCATGTCGGT GGCTCCCATC GATGCAATGG TTAATGGGCC TCTTCCAAAT 154 5 

CCAGAGGCAC CGATTAGAGT GGTGTCTGTG CCTGAATCAG A TTCTTTTA T GTCTTCAGTA 1605 

CCTGATAATT CGACTCCACT ATACCCCAAG GTTGTGGTCC CACCGCGCCA AGTTCCTGGC 166 S 

CGGTTTACAA ATTTCATTGA TGTGGCAAAA CAGACATATT CATTTTGTTC CATTTCTGGA 1725 

AAACCTTATT TTGAGGTTAC CAACACCTCT GGGGACGAGC CACTGTTTCA GATGGATGTG 1785 

TCGCTCAGTG CGGCAGAGCT ACATGGCACT TACGTAGCTA GTTTGTCATC ATTTTTTGCA 1845 

CAGTACAGAG GCTCACTTAA TTTCAACTTT ATTTTCACTG GTGCAG CAG C CACTAAGGCA 1905 

AAGTTTCTGG TTGCTTTTGT GCCTCCCCAC AGTGCAGCGC CCAAAACGCG CGATGAAGCA 1965 

ATGGCGTGCA TCCATGCCGT GTGGGATGTT GGCTTGAACT CAGCTTTTTC TTTTAATGTA 2 02S 

CCTTATCCCT CCCCTGCTGA CTTCATGGCC GTTTATTCTG CGGAACGGAC GGTTGTGAAT 2085 

GTCTCTGGAT GGCTTCAAGT TTATGCACTA ACAGCTCTAA CTTCAACTGA CATTGCCGTG 2145 

AACAGTAAAG GCCGTGTGCT G G T T GCT GT T TCCGCCGGCC CAGACTTCTC CCTTCGTCAC 22 05 

CCGGCGGACC TGCCCGACAA GCAGGTTACC AATGTGGGAG AGGATGGTGA ACCCGGTGAG 226S 

ACAGAGCCTC GTCATGCTTT GTCACCCGTG GACATGCACG TGCACACAGA TGTCAGTTTC 232 5 

TTGCTTGACC GGTTCTTTGA TGTTGAGACA CTTGAGCTTT CAAATTTGAC AGGTTCTCCT 2 385 

GCCACACATG TTCTGGATCC GTTTGGCTCG ACTGCCCAAC TGGCTTGGGC ACGTCTGCTA 2 44 5 

AACACTTGCA CCTACTTCTT TTCTGATTTG GAATTGTCAA TCCAGTTTAA ATTTAC CACC 2 SOS 

ACTCCGTCCT CTGTTGGAGA GGGCTTTGTG TGGGTGAAGT GGCTCCCTGT TGGAGCACCA 2 565 

ACCAAGACCA CAGATGCTTG GCAGTTAGAA GGAGGTGGAA ATT CAG TT AG AATTCAAAAA 2625 

TTGGCCGTTG CAGGGATGTG CCCCACTGTT GTGTTCAAGA TTGCAGGCTC CCGTTCACAA 2685 

GCCTGTGCTT CAGCGTTGCC ATATACATCA ATGTGGCGTG TTGTGCCAGT CTTTTACAAT 2745 

GGCTGCGGTG CACCTACCAA AGAAAAGGCA ACCTACAATT GGCTTCCTGG TGCACACTTT 2805 

GGTTCCATCT TGCTGACTTC TGATGCGCAT GATAAAGGAG GGTGCTACTT GCGGTATGCT 2865 

TTCCGCGCGC CAGCGATGTA TTGCCCTCGA CCCATTCCGC CGGCTTTTAC GCGTCCAGCG 2925 

GACAAAACCA GACATAAATT TCCCACTAAC ATCAACAAAC AGTGTACTAA TTACTCTCTC 2 98S 

CTCAAATTGG CTGGAGATGT TGAGAGCAAC CCTGGCCCCA CTATTTTTTC CAAAGCATCA 3 045 

GCAGACCTGA ATGCCTTGTC AACGTCGCTA GGTGAATTGA CTGGCATGCT AAAAGATCTT 3105 
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AAAGCCAAGG CAGAAACTTA TTCCCCGTTT TACAAAATGG CCAAAATGCT TTTCAAACTT 316 5 
GCAACACTAG CTGTGGCAGC TATGAGGACA AAGGAC C CAG TAGTGGTGGT TATGTTGATT 3 22 5 
GCTGATTTCG GATTGGAGGT CTTTGAC AC T GG GTTTTT CT TTTCCTACTT TCAAGAGAAG 3 28 5 
TTGCAGCCTT ATATGAAAAC TATTCCTGGT AAGATTTCTG ATTTGGTCAC TGATGCGGCT 3 34 5 

5 ACGGCTGCCG CCCAAATTCC AAAGGGAGTG TATTCTTTTG TGTCGTCATT TTTCGAAACG 34 05 

CCTGAAGGAG TGGTTGAGAA GCAGGTGTCT CTTCGGACAG TGAATGACAT ATTTGCTTTG 3465 

CTTAAAAATT CTGATTGGTT CATAAAGACT CTTGTTGCCC TCAAGAAATG GCTGACATCC 3 525 

TGGTTTGCTC AAGAACAACA GGCAGATGAT GCGCTCTATT CAGAATTGGA AAAATATCCC 3585 

TTGTACAAGT TAAAATTGAA GGAACCTGAT ACTCAAGAGG AAGCGCGCCA GTGGTTT AAA 364 5 

0 GACATGCAGC AGCGTGCTCT CGCTGTGAAG GACAAAGGTC T CTTTTC CCT CCTGCAAATT 3 705 

CCATTAGTTA ACTTGCCCCA GAGCCGTCCA GAGCCCGTTG TATGCGTCCT TCGGGGCGCA 3 765 

TCAGGGCAAG GCAAATCTTA TTTGGCAAAT CTGATGGCTC AAGCAATTTC GCTTCTCTTG 3825 

GTTGGCAAGC AGGACAGTGT GTGGAGTTGT CCTCCTGACC CCACATATTT TGATGGCTAT 3 88 5 

AACGGACAGG CTGTGGTGAT TATGGATG CA TTGGGCCAGG ATCCGAATGG TGCTGACTTT 3945 

5 AAATATTTTT GCCAGATGGT CTCTACAACA G CT T T T GTA C CACCTATGGC CCATTTGGAT 4 00S 

GATAAAGGCA TTCCATTTAC TTCTCCTGTT GTTATTTGTA CTACAAATTT GCATTCATCT 406 5 

TTTACCCCTA TTACTGTTTC TTGTCCTGAA GCTCTTAAGA GGAGGTTTCG GTTTGATGTG 4X25 

ACGGTGTCCG CTAAACCGGG CTTTGTGCGC ACTGTTGGTT CAAACCAGCT TTTGAATCTC 418 5 

CCACTTGCTC TTAAGCCAGC TGGTCTTCCC CCACACCCTA TCTTTGAAAA TGACATGCCC 4 245 
0 ATTATAAATG GGCAGGCTGT TAAATTGGCT CTTTCTGGTG GAGAAGTGAC AGCTTTTGAG 
CTTATTGAGA TGATACTGTC AGAAGTTCAA AACAGACAAG ACACACACAA AATGCCCATT 

TTTAAACAAT CATGGTCTGA TTTGTTCAGA AAGTGTACAA CTGATGAGGA ACAGAAAATG 4425 

TTGCAGTTTT TAATTGACAA TAAAGATTCA GAAATTCTCA GGGC GTTTGT TTCAGAACGC 4465 

TCCATTTTAC TACATGAAGA GTATCTTAAA TGGGAGTCAT ATATGACCAG GAGAGCCAAG 4 54 5 

TTTCACCGCC TGGCTCCTGA TTTTGCTATG TTTCTATCCA TTCTTACTTC ACTGATTGTT 4605 

ATTTTTTGTT TAGTTTATTC TATGTATCAA CTTTTTAAGA CCCCTGACGA GCAATCAGCT 4665 

TATGATCCTT CAACTAAGCC AAAACCAAAG ACCCAGGAAG TGAAAACACT GAAGATTAGG 4 72 5 

ACTGAGACTG GTGTACCAGC AACTGACTTG CAACAATCCA TCATGAAAAA TGTTCAGCCA 4 78 5 

ATTGAGCTTT ACCTTGACAA TGAATTGGTT ACTGACTGCT CTGCCTTGGG TGTTTATGAC 4845 

AATTCATATT TGGTGCCCCT TCATTTGTTT GAATTTGATT TTGATACCAT TGTGCTTGGT 4 905 

GGACGTCATT ACAAGAAAGC TGAGTGTGAG AAGGTAGAGT TTGAGCTTGA AGTGAATGGA 4 965 

GACGTGGTGT CATCAGATGC GTGTCTACTT CGAGTGTCAT CGGGGCCTAA AGTTAGAAAT 502 5 

ATTGTTCATC TTTTTACAAA TGAAATTGAA TTGAAGAAAA TGACCCAAGT GACAGGAATC 508 5 

ATGAATTCAC CACACCAGGC ACGCACTGTG TTTTTTGGCA GTTTTTTGAC AGTGAGGAAG 5145 

TCCATCTTAA CATCGGATGG GACTGTAATG CCCAATGTTT TGTCCTATGC CGCTCAGACC 52 OS 

TCGCGTGGGT ATTGTGGCGC TGCAATTGTT GCTGGCTCAC CTGCCCGCAT AATTGGTATC 5265 

CATTCAGCTG GCACTGGATC TGTTGCATTT TGCTCCCTGG TGTCCAGAGA CGCGCTGGAG 5325 

CAACTCTGGC CCCAGAAACA GGGCAACGTT AGTCGCCTTG ATGACGATGT GAGGGTGTCT 538 5 

GTTCCGCGCC GCTCCAAATT GGTGAAATCA TTGGCTTACC CCATTTTCAA ACCTGACTAT 544 5 

GGCCCAGCGC CACTCTCTCA ATTTGACAAG CGCCTGTCAG ACGGCGTGAA GCTGGATGAA S505 

GTGGTTTTTG CTAAACATAC TGGAGACAAG GAGATTTCCG CACAGGACCA GAAATGGCTC S565 

TTGCGTGCGG CGCATGTATA CGCCCAGAAG GTTTT CTCCC GGATTGGATT TGACAACCAG 5625 

GCTTTGACTG AAAAAGAGGC CATTTGTGGC ATTCCTGGCC TTGACAAGAT GGAGCAGGAC 5685 

ACCGCTCCCG GGCTGCCCTA TGCTCAGCAA AATAAGAGAA GGAAAGACAT CTGTGATTTT 574 5 

GAAGAGGGCC GGCTGAAGGG CGCCGAACTC CAAAAGGACA GATTTATGGC TGGTGACTAC S805 

TCTAATTTGG TCTATCAATC ATTTTTGAAA GATGAGATCC GCCCACTTGA GAAAGTTAGG 5865 

GCTGGAAAGA CCCGCCTGAT TGACGTGCCG CCGATGCCCC ATGTGGTGGT TGGTAGGCAG S92 5 
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CTCTTGGGCC GGTTTGTGGC AAAATTTCAT GAAGCAAATG GATTTGACAT TGGCTCAGCC 598 5 

ATTGGATGTG AC C CAGATGT GGACTGGACT CGGTTTGGCC TCGAGTTGGA GCGTTTCAGG 6045 

TATGTATATG CCTGTGACTA CTCACGGTTC GATGCCAACC ATGCAGCTGA TGCAATGAGA 6105 

GTTGTGCTTA ACTACTTTTT CTCTGAGGAC CACGGTTTCG ACCCTGGTGT GCCTGCTTTT 6165 

ATTGAGTCAC TGGTTGATTC AGTGCATGCC TATGAAGAGA AAAGGTATAA CATCTACGGT 622 S 

GGCTTGCCAT CCGGGTGTTC CTGCACATCA ATTTTGAATA CCATCTTGAA CAATGTTTAC 6 285 

ATTCTTGCAG CTATGATGAA GGCTTATGAG AATTTTGAGC CAGATGACAT TCAGGTCATT 6345 

TGCTATGGGG ACGACTGCCT CATTGCTTCT GATTTTGAAA TTGATTTCCA ACAACTGGTG 64 05 

eCTGTCTTTT CTAGTTTTGG ACAGGTAATA ACTACAGCTG ACAAGACTGA TTTTTTTAAA 64 65 

CTGACAACGC TTTCGGAGGT GACCTTCCTT AAGCGCGCTT TTGTTCTGAC GGC CTTTTA C 6S25 

AAG CCAGTGA TGGATGTGAA GACCCTTGAA GCAATCTTAA GCTTTGTTCG CCCAGGCACA 6 585 

CAGGCTGAAA AGCTCCTGTC CGTGGCGCAG TTGGCAGGCC ACTGCGAACC GGAGCAGTAT 6645 

GAGCGCCTGT TTGAGCCCTT TGCTGGGATG TATTTCGTCC CTACTTGGCG ACTTGCGCCT 6705 

GCAGTGGTTG ATGAAGCTTG GATGCTAAAT T CTTTTTG AC TTTGTTTTTC TTTGTTTTCT 6765 

TTTAGGCTTT TAAGGTGTTA AGTTTAAAGG TTAAGAGTTT TTAGAAGTTA AGATAGAGTT 6825 
TAGTTTTTAG TTTTGAGC-poly (A) 

as disclosed in Fig. 2 and functional equivalents of said nucleotide sequence 
including naturally occurring derivatives, variants, degeneracy equivalents and 
deletion mutants thereof. 

In another aspect, the invention provides a substantially pure amino acid 
sequence being: 

a substantially pure amino acid sequence being: 

MAASKVYRVCEQTLLAGAVRMMDKF 
LQKRTVFVPHLDKTIRLTGLHNYDN 
TCWLNALTQLTQILGIRLFDEHFGN 
RGLFTRKTIDWVSDQTGIJCDLKSGA 
PPLVVVYKLWQHGHLDVGTMEKPRS 
ITLWSGPKVCLSDFWACVSAKPGHA 
VFYLLTSEGWICVDDKKIYPETPKT 
EDVLVFAPYDFESLGKDPPKLHQRY 
L * VP4 

EKAFELSGGGTSTPTTGNQNMSGNS 
GS iVQNFYMQQYQNSrDADLGDNVI 
SPEGQGSNTSSSTSSSQSSGLGGWF 

VP 4 + VP2 

SSLLNLGTKLLADKKTEETTNXEDR 
IETTVVGVT1 INSQGSVGTTYCYSK 
PDGRPPSTVSDPVTRI-GPTLSRHYT 
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VP 2 + VP 3 

SVAPIDAMVNGPLPNPEAPIRVVS 
PESDSFMSSVPDNSTPLYPKVVVP 
RQVPGRFTNFIDVAKQTYSFCS IS 
KPYFEVTNTSGDEPLFQMOVSLSA 
ELHGTYVASLSS FFAQYRGSLNFN 
I FTGAAATKAKFLVAFVPPHSAAP 
TRDEAMACIHAVWDVGLNSAFS FN 
PYPSPADFMAVYSAERTVVNVSGW 
QVYALTALTSTDIAVNSKGRVLVAV 

VP 3 4 VP1 

SAGPDFSLRHPADLPDKQVTNVGED 
GEPGETEPRKALSPVDMHVHTDVS F 
LLDRFFDVETLELSNLTGSPATHVL 
DPFGSTAQLAWARLLNTCTYFFSDL 
BLS IQFKFTTTPS SVGEGFVWVKWL 
PVGAPTKTTDAWQLEGGGNSVRI QK 
I-AVAGMCPTVVFKIAGSRSQACASA 
LPYTSMWRVVPVFYNGWGAPTKEKA 
TYNWLPGAHFGS ILLTSDAHDKGGC 
YLRYAFRAPAMYCPRPIPPAFTRPA 

VP1 * 2A 

DKTR HKFPTN INKQCTNYSLLKIjAG 
2A * 2B 

dvesnpgptifskasadlnalstsl 
geltgmlkolkakaetyspfykmak 
mlfklatlavaamrtkdpvvvvmli 
adfglevfdtgfffsyfqeklqpym 
k t i pgkisdlvtdaataaaqipkgv 
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2B * 2C 

Y S FVSSFFETPEGVVEKQVSLRTVN 
DIFALLKNSDWFIKTLVALKKWLTS 
WFAQEQQADDALYSELEKYPLVKLK 
5 LKEPDTQEEARQWFKDMQQRALAVK 
DKGLFSLLQI PLVNLPQS RPEPVVC 

VLRGASGQGK1SYLANLMAQAXSL LL 

VGKQDSVWSCPPDPTYFDGYNGQAV 

VXMDALGQDPNGADFKYFCQMVSTT 

AFVPPMAHLDDKGXPFTSPVVICTT 

NLHSSFTPITVSCPEALKRRFRFDV 

TVSAKPGFVRTVGSNQLLNLPLALK 

PAGLPPHP IFENDMPIINGQAVKLA 

LSGGEVTAFELI EMILSEVQNRQDT 

2C + 3 A 

HKMPIFKQSWSDLFRKCTTDEEQKM 

LOFLIDNKDSEILRAFVSERSXLLH 

EEYLKWESYMTRRAKFHRLAADFAM 

PLSILTSLIVIFCLVYSMYQLFKTP 
3A 4 3B 

DEQSAYDPSTKPKPKTQEVKTLKIR 
3B + 3C 

TETGVPATDLQQSIMKNVQPXELYL 

DNELVTDCSALGVYDN5YLVPLHLF 

EFDFDTXVLGGRHYKKAECEKVEFE 

LEVNGDVVSSDACLLRVSSGPKVRN 

IVHLFTNEIELKKMTQVTGIMNSPH 

QARTVFFGSFLTVRKSILTSDGTVM 

PNVLSYAAQTSRGYCGAAIVAGSPA 

*IIGIHSAGTGSVAFCSLVSRDALE 
3C * 3D 

QLWPQKQGKVSRLDDDVRVSVPRRS 
KLVKSLAYPIFKPDYGPAPLSQFDK 
RLSDGVKLDEVVFAKHTGDKE I SAQ 
DQKWLLRAAHVYAQKVFSRIGFDNQ 
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as disclosed in Fig. 2. 

In another aspect, the invention provides proteins derived from ERhVl which 
exhibit virus like particle characteristics incorporating VP I and having the 
following amino acid sequence: 

a protein or vims like particle incorporating VP I, derived from ERhVl and 
having the following amino acid sequence: 
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In another aspect, the invention provides proteins derived from ERhVl which 
exhibit virus like panicle characteristics incorporating VP2 and having the 
following amino acid sequence: 

a protein or virus like panicle incorporating VP2, derived from ERhVl and 
having the following amino acid sequence: 
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In another aspect, the invention provides proteins derived from ERhVl which 
exhibit virus like particle characteristics incorporating VP3 and having the 
following amino acid sequence: 

a protein or virus like particle incorporating VP3, derived from ERhVl and 
having the following amino acid sequence: 

APIRVVSVPBSDSFMSS VPDNSTPL 

YPKVVVPPRQVPGRFTNFIDVAKQT 

YSFCSISGKPYFEVTNTSGDEPLFQ 

MDVSLSAAELHGTYVASLSSFFAQY 

RGSLNFNFIFTGAAATKAKFLVAFV 

PPHSAAPKTRDEAMACIHAVWDVGL 

NSAFSFNVPYPSPADFMAVYSAERT 

VVNVSGWLQVYALTALTSTDIAVKS 

KGRVLVAVSAGPDFSLRHPADLPDK 
Q 
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In another aspect, the invention provides proteins derived from ERh V I which 
exhibit virus like panicle characteristics incorporating VP4 and having the 
following amino acid sequence: 

a protein or virus like panicle incorporating VP4, derived from ERh VI and 
having the following amino acid sequence: 

GGGTSTPTTGNQNMSGNSGS XVQNF 

YMQQYQNSIDADLGDNVrSPEGQGS 

NTSSSTSSSQSSGLGGWFSSLLNLG 
T K L L A 

The invention also provides a virus like panicle comprising any one or a 
combination of VP I, VP 2, VP3 and VP 4. 

In another aspect, the invention provides a substantially pure nucleotide 
sequence for VP1 being: 



GTTACCAATG 


TGGGAGAGGA 


TGGTGAACCC 


GGTGAGACAG 


AGCCTCGTCA 




CCCGTGGACA 


TGCACGTGCA 


CACAGATGTC 


AGTTTCTTGC 


TTGACCGGTT 


CTTTGATGTT 


GAGACACTTG 


AGCTTTCAAA 


TTTGACAGGT 


TCTCCTGCCA 


CACATGTTCT 


GGATCCGTTT 


GGCTCGACTG 


CCCAACTGGC 


TTGGGCACGT 


CTGCTAAACA 


CTTGCACCTA 


CTTCrTTTCT 


GATTTGGAAT 


TGTCAATCCA 


GTTTAAATTT 


ACCACCACTC 


CGTC CTCTGT 


TGGAGAGGGC 


TTTGTGTGGG 


TGAAGTGGCT 


CCCTGTTGGA 


GCACCAACCA 


AGACCACAGA 


TGCTTGGCAG 


TTAGAAGGAG 


GTGGAAATTC 


AGTTAGAATT 


CAAAAATTGG 


CCGTTGCAGG 


GATGTG CCCC 


ACTGTTGTGT 


TCAAGATTGC 


AGGCTCCCGT 


TCACAAGCCT 


GTGCTTCAGC 


GTTGCCATAT 


ACATCAATGT 


GGCGTGTTGT 


GCCACTCTTT 


TACAATGGCT 


GGGGTGCACC 


TACCAAAGAA 


AAGGCAACCT 


ACAATTGGCT 


TCCTGGTGCA 


CACTTTGGTT 


CCATCTTGCT 


GACTTCTGAT 


GCGCATGATA 


AAGGAGGGTG 


CTACTTGCGG 


TATGCTTTCC 


GCGCGCCAGC 


GATGTATTGC 


CCTCGACCCA 


TTCCGCCGGC 


TTTTACGCGT 


CCAGCGGACA 


AAACCAGACA 


TAAATTTCCC 


ACTAACATCA 


ACAAACAGTG 


TACT 









and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 
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In another aspect, 


the invention provides a substantially pure nucleotide 


sequence for VP2 being: 










GACAAGAAGA 


CAGAAGAGAC 


TACAAACATT 


GAAGACAGAA 


TTGAAACAAC 


AGTGGTTGGA 


GTCACTATTA 


TTAATTCACA 


AGGATCTGTT 


GGAACAACCT 


ACTGTTACTC 


CAAACCGGAT 


GGTAGACCAC 


CATCCACAGT 


GTCAGACCCA 


GTTACCAGAC 


TTGGACCCAC 


GCTTTCCAGG 


CACTACACAT 


TTAAGGTAGG 


TGAGTGGCCC 


CATTCTCAAT 


CACATGGTCA 


CGCATGGATC 


TGTCCGTTGC 


CAGGTGACAA 


ACTCAAGAAG 


ATGGG CAGTT 


TTCATGAGGT 


TGTCAAAGCC 


CACCACCTGG 


TCAAGAACGG 


CTGGGATGTG 


GTTGTGCAGG 


TGAATCCCTC 


ATTTGCTCAC 


TCCGGGCCGC 


TGTGTGTAGC 


AGCAGTGCCG 


GAGTACGAAC 


ACACACATGA 


GAAAGCACTC 


AAGTGGTCTG 


AGCTTGAGGA 


ACCAGCTTAC 


ACATACCAAC 


AACTTTCAGT 


TTTTCCCCAC 


CAGTTGCTAA 


ATTTGAGGAC 


AAATTCATCA 


GTGCATTTGG 


TGATGCCCTA 


CATTGGGCCA 


GGCCAACCAA 


CAAATCTGAC 


TTTGCACAAC 


CCGTGGACCA 


TTGTTATTTT 


AATTTTGTCT 


GAATTGACAG 


GACCTGGCCA 


AACTGTGCCT 


GTGACCATGT 


CGGTGGCTCC 


CATCGATGCA 


ATGGTTAATG 


GGCCTCTTCC 


AAATCCAGAG 









and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 

In another aspect, the invention provides a substantially pure nucleotide 
sequence for VP3 being: 

GAGTGGTGTC TGTGCCTGAA TCAGATTCTT 



GCACCGATTA 
AATTCGACTC 



CACTATACCC CAAGGTTGTG GTCCCACCGC 



TTATGTCrTC AGTACCTGAT 
GCCAAGTTCC TGGCCGGTTT 



ACAAATTTCA TTGATGTGGC AAAACAGACA TATTCATTTT GTTCCATTTC TGGAAAACCT 

TATTTTGAGG TTACCAACAC CTCTGGGGAC GAGCCACTGT TTCAGATGGA TGTGTCGCTC 

AGTGCGGCAG AGCTACATGG CACTTACGTA GCTAGTTTGT CATCATTTTT TGCACAGTAC 

AGAGGCTCAC TTAATTTCAA CTTTATTTTC ACTGGTGCAG CAGCCACTAA GGCAAAGTTT 



CTGGTTGCTT TTGTGCCTCC CCACAGTGCA GCGCCCAAAA CGCGCGATGA AGCAATGGCG 
TGCATCCATG CCGTGTGGGA TGTTGGCTTG AACTCAGCTT 



TTTCTTTTAA TGT AC CTT AT 
GGACGGTTGT GAATGTC TCT 



CCCTCCCCTG CTGACTTCAT GGCCGTTTAT TCTGCGGAAC 
GG AT GG CTT C AAGTTTATGC ACTAACAGCT CTAACTTCAA CTGACATTGC CGTGAACAGT 
AAAGGCCGTG TGCTG GTTGC TGTTTCCGCC GGCCCAG ACT TCTCCCTTCG TCACCCGGCG 
GACCTGCCCG ACAAGCAG 

and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 
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In another aspect, the invention provides a substantially pure nucleotide 
sequence for VP4 being: 

GGCGGAGGTA CATCCACTCC AACAACTGGC AACCAAAACA TGTCCGGAAA CAGTGGTTCA 

ATTGTTCAAA ATTTTTACAT GCAACAGTAC CAGAATTCAA TTGACGCAGA CCTGGGAGAC 

AATGTGATTA GCCCTGAAGG CCAGGGCAGC AACACTAGTA GTTCAAC CTC ATCAAGCCAA 

TCCTCTGGCT TGGGCGGGTG GTTCTCTAGT TTGCTGAACC TTGGAACAAA ACTACTGGCT 

and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 

In another aspect, the invention provides oligonucleotide primers derived 
from the nucleotide sequence of Fig. 2 being highly specific for ERhVl or cross 
reactive with other ERhV types. 

The oligonucleotide primers may have any one of the following nucleotide 
sequences: 

VP IF 5' GTTGTGTTCAAGATTGCAGGC 3' 

VP1RI 5' TTGCTCTCAACATCTCCAGC 3' 
VP1R2 5' TAGCACCCTCCTTTATCATGCG 3' 

In another aspect, the invention provides an oligonucleotide probe derived 
from the sequence of Fig. 2. 

In another aspect, the invention provides diagnostic reagents, methods and 
kits characterised by the aforesaid oligonucleotide primers and probes. 

In another aspect, the invention provides antigens comprising any one or a 
combination of the non-capsid proteins, being other than the individual VP1 to VP4 
proteins, that are cleavage products of the polypeptide of Figure 2. 

In another aspect, the invention provides vaccines and vectors incorporating 
any one or a combination of virion proteins VP1 to VP4. 

In another aspect, the invention provides diagnostic tests for the detection of 
antibodies to ERhVi in blood of horses or other animals characterised by the use 
of the aforesaid antigens. Such diagnostic tests may be ELISA based. 

Tn a particularly preferred embodiment, the invention provides a test to 
distinguish horses infected with ERhVl in which said virus had replicated from 
horses which have been vaccinated with the vaccine incorporating any one or a 
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combination of virion proteins VP I to VP4; comprising the steps of applying an 
antigen being any one or a combination of non-capsid proteins, being other than 
VP1 to VP4, that are cleavage products of the polypeptide of Figure 2 to a horse 
and testing for an immunoreaction thereto, wherein a positive immunoreaction 
5 would indicate that said horse had been infected with ERhVl and a negative 
immunoreaction would indicate that said horse has not been infected with ERhVl. 

In another aspect, the invention provides recombinant plasmids incorporating 
nucleotide sequences and subsequences derived from the nucleotide sequences of 
Fig. 2. The recombinant plasmid may comprise the P1-2A-3C region of the 
10 ERhVl genome. 

In another aspect, the invention provides a host system characterised by 
incorporating the nucleotide sequence of Fig. 2 or part thereof. The host may be 
E.coli, vaccinia virus, baculovirus or yeast. 

In another aspect, the invention provides a process for producing a protein 
15 product derived from ERhVl comprising the steps of selecting out a gene of 
interest from the ERhVl nucleotide sequence of Fig. 2 and expressing said protein 
product in a suitable host system. 
DETAILED DESCRIPTION OF INVENTION 

The invention will now be described in detail with reference to Figs. 1 to 6: 
20 Fig. 1 (A) Schematic representation of the ERhVl genome and (B) 

comparison of the genomic structures of picomaviruses showing the predicted 
proteolytic cleavage pattern of the polyprotein. The lengths of individual regi ns 
are drawn approximately to scale. The dashed line represents the unsequenced 
region of the ERhVl 5'-NTR. 
25 Fig. 2a Nucleotide and predicted amino acid sequence of the ERhVl 

polyprotein. The nucleotide sequences of the 3'-NTR and part of the 5'-NTR are 
also shown. Numbering is from the first ATG codon that occurs in a context 
optimal for translational initiation (Kozak, 1989). A polypyrimidine tract upstream 
of the putative initiating ATG and the two pairs of in-frame ATG codons are 
30 underlined. The predicted proteolytic cleavage sites are indicated by arrows. 
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Fig. 2b Nucleotide sequence of the ERhVl 5-nontransiated region. 
The polyC tract (dotted underline), polypyrimidine tract (underline) and potential 
initiation codons (double underline) are indicated. Predicted coding sequence is 
shown in bold type. Numbering is from the ATG considered most likely to be used 
5 for translation initiation. 

Fig. 3 Alignment of the predicted amino acid sequences of 

ERhV 1.3 93/76 and FMDV.OIK polyprotein. Proteolytic cleavage sites, which are 
predicted in the case of ERhV 1, are indicated by the arrows. Identical residues (*), 
highly conserved residues (:), and less conserved residues (.), are indicated. 
10 Fig. 4 Unrooted phylogenetic trees inferred using the picornavirus 

nucleotide sequences of (A) the complete polyprotein gene, (B) the polymerase 
gene and (C) VP1 gene of viruses representing the five recognised genera of the 
family Picomaviridae. The viruses used were: 

FMDV.A10, FMDV.OIK, FMDV.AI2, FMDV.C3, FMDV.SAT3, EMCV, TMEV, 
15 Mengovirus, poliovirus LMahoney (Polio 1), poliovims 2.Sabin (Polio 2), 
poliovirus 3. Leon (Polio 2), coxsackievirus A9 (CV.A9), CV.B3, echovinis 22 
(Echo 22), swine vesicular disease virus (SVDV), bovine enterovirus (BEV) 
hepatitis A virus (HAV) human rhinovirus IB (HRV1B), HRV89 and HRV14. 
Note: The branch lengths represent proportionate change only within each 

20 tree; they do not allow direct comparisons to be made between the 

three trees. 

Fig. 5(A) Diagram outlining the strategy for nested, reverse transcription- 
polymerase chain reaction (RT-PCR) for the detection of ERhV genome. The 
genome structure of ERhV 1 is shown schematically (top), and the first round PCR 
25 product (362bp), corresponding to VP1 and 2A regions, and the second round PCR 
product (210bp), corresponding to part of VP 1, are represented as black lines. 

(B) the sequence of specific oligonucleotide primers used for RT-PCR are 
shown. VPIR1 was used for the RT reaction. 

Fig. 6 Construction of ERhV I expression plasmid for E. coli and 

30 baculovirus transfer vector for insect cells. The ERhVl genome is shown (top) and 
oligonucleotide primers used to amplify PI. 2 A and 3C regions are depicted as 
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arrows. The PI. 2 A fragment and subsequently the Pl.2A.3C fragment, obtained 
through the ligation of PI. 2 A and 3C, were cloned separately into the multiple 
cloning sites of the pET15b and pBacbluIII plasmid vectors to construct pET.P1.2A 
and pET.P1.2A.3C respectively for expression in E. coli and pBac.Pl.2A and 
pBac.P1.2A.3C respectively for expression in insect cells. 

The sequence of specific oligonucleotide primers used for the construction 
of expression plasmids are: 

VP4F 5' GCTGGATCCATGAGTGGCGGAGGTACATCCACT 3* 
R2A 5' GCTCTGCAGCAGGTCTGCTGATGCTTTGGA 3' 
3CF 5' GCTCTGCAGATGATTAGGACTGAGACTGGTGT 3' 
3CR 5' GCTGGATCCTTAGCCATAGTCAGGTTTGAA 3' 
Virus growth and purification 

ERhVl strain 393/76 was isolated from a nasal swab taken from a 
thoroughbred horse in South Australia while it was being held in quarantine 
following importation from the United Kingdom. The mare had an acute, systemic 
febrile illness. The virus was passaged 14 times in equine fetal kidney (EFK) 
monolayer cell cultures and then once in Vero cells. ERhVl virions were purified 
by a modification of the procedure described by Abraham and Colonno. Cells were 
harvested 48 hours after infection. The infected cells and supernatant fluid were 
frozen and thawed three times and clarified by centrifuging at 2,000 x g for 20 min 
at 4 C. Polyethylene glycol 6000 and NaCl were added to the supernatant to final 
concentrations of 7% and 380 mM, respectively, and the mixture was stirred 
overnight at 4 C. The precipitated virions were recovered by centrifuging at 10,000 
x g for 15 min at 4 C and rcsuspended in 200-400 p.1 TNE buffer (10 mM Tris-HCI 
pH 8.0, 100 mM NaCl, 1 mM EDTA) containing 1% NP40. The suspension was 
clarified by centrifuging at 12,000 x g for 3 min before layering onto 15% to 45% 
(wt/vol) linear sucrose gradients (35 ml) in TNE buffer and centrifuging at 100,000 
x g for 4 h at 4 C. Gradients were fractionated and the fractions analyzed by 
SDS-PAGE. Viral fractions were pooled, centrifuged at 200,000 x g for 2 h at 4 
C, and the viral pellet was resuspended in a small volume of TNE buffer, cDNA 
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synthesis and cloning. Viral RNA was reverse transcribed using an oligo-dT primer 
(Amersham) or ERhVl specific primers 

PI (5'-ATCCAGCAAGCCGCTGTCCGGTTAC-3') and P5 
(5'-CGAAGAGACACCTGCTTC-3'). Viral RNA was prepared as described in 
(1987) Anal-Biochem. 162, 156-159. 

Viral RNA and 1 00 pmol of primer were mixed, boiled for 2 min and cooled 
at room temperature. First strand cDNA was synthesized using 200 U of Maloney 
murine leukemia vims reverse transcriptase (Promega) in the presence of 0.8 raM 
dNTPs and 30 U of human placental RNAse inhibitor (Pharmacia) in a reaction 
volume of 25 ul. Second strand cDNA was synthesized using a cDNA synthesis 
kit (Amersham). The cDNA fragments were ligated into pUCl8, either as blunt 
ended fragments or after ligating BamH I adaptors (Pharmacia), and the lighted 
products used to transform E. coli strain DH5a (Stratagene). Colonies were 
selected by hybridization, initially with an [32PJ-dCTP-labelled cDNA probe 
derived from reverse transcribed viral RNA, and subsequently with 
[32P]-dCTP-labelled cloned viral cDNA (16). The sequence between two cDNA 
clones was obtained using the oligonucleotide primers P6 
( 5 ' - T T C T G G T G G A G A A G T G A C A G C - 3 ' ) and P7 
(5'-GTGAGCCAGCAACAATTGC-3') in a polymerase chain reaction (PCR; 17) 
using the polymerase, Vent Exo+- (New England Biolabs). 

DNA sequencing and analyses 

Double-stranded DNA was prepared using the alkaline lysis method and 
sequenced by dideoxy chain termination using modified T7 DNA polymerase 
(Pharmacia) and [35S]-dATP (Amersham). Sequence was read and analyzed using 
the GeneWorks software package (IntelliGenetics, Mountain View, CA). The 
GenBank database was searched using the FASTA searching and comparison 
program. The protein alignment shown in Fig. 3 was performed using the Genetics 
Computer Group, Inc. (Madison, Wisconsin, USA, 1994) GAP program with a gap 
creation penalty (GCP) of 3.0 and a gap extension penalty (GEP) of 0.1. The 
multiple alignments of nucleotide sequences were performed using ClustalW. For 
pairwise alignments the slow method was used with a GCP of 10 and a GEP of 0. 1. 
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For multiple alignments a GCP of 10 and a GEP of 0.05 was used, with alignment 
of sequences which were more than 60% divergent delayed and using weighted 
transitions. Phylogenetic relationships were examined using the maximum 
likelihood method with the DNAML program of the Phylogeny Inference Package 
5 (Phylip) version 3.5c (1993. J. Felsenstein. Department of Genetics, University of 
Washington, Seattle). The model used allowed for unequal expected frequencies 
of the four nucleotides, with the frequencies determined empirically from those 
present in the sequences analysed, and unequal rates of transitions and transversions. 
A single rate of change was assumed for all sites. The program was allowed to 
1 0 perform global rearrangements to optimise the tree. Initial analyses were performed 
on polymerase sequences using a range of transition/transversion ratios to determine 
that which gave the maximal log likelihood. A ratio of 2.0 gave the maximal log 
likelihood and thus this ratio was used for all subsequent analyses of other 
sequences. 

15 Cloning And Sequencing Of The ERHV1 Genome 

Sixty seven overlapping cDNA clones and one PCR product clone were 
obtained and sequenced from both ends. The nucleotide in each position was 
determined at least twice, and 95% of the sequence was obtained by sequencing in 
both directions. The predicted genomic structure of ERhVl was characteristic of 
20 picornaviruses, possessing one long open reading frame (ORF) flanked by 5'- and 
3'-NTR's (Fig. 1). 

The nucleotide and predicted amino acid sequences of the ERhVI 
polyprotein are shown in Fig. 2a. Partial sequence of the 5'-NTR (433 bases) was 
also obtained Fig. 2b. There was a tract of 9 Cs at position -550 to -542. PolyC 

25 tracts of various lengths have been observed in similar locations in FMDV and 
EMCV. The actual length of the ERhVl polyC tract is uncertain as these 
sequences are known to be unstable when propagated in E. coli. A 14 nucleotide 
polypyrimidine tract, which possessed the TTTC motif common to all 
picornaviruses, was present near the potential translation initiation codons. A 

30 region of 450 nucleotides upstream of the most likely initiation codon is predicted 
to contain an internal ribosorne entry site (IRES). This region showed most 
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sequence identity (48-50%) with corresponding sequences in FMDV and EMCV. 
The 3'-NTR of ERhVl was 102 nucleotides excluding the polyA tail (data not 
shown). 

In picornavinises. there are two factors thai influence which ATG codon 
initiates translation, a requirement for the ATG to be located at the 3'-end of the 
IRES, and that this ATG occurs in a sequence optimal for initiating translation, that 
is, a purine at position -3 and a G in position +4. Two pairs of in-firame ATG 
codons were identified in the ERhVl genome. The second ATG of the first pair 
is separated by 25 nucleotides from the beginning of the polypyrimidine tract (Fig. 
2b), similar to the distance (25 to 27 nucleotides) found in the corresponding 
regions in FMDV and EMCV (24). The second ATG of each pair occurs in an 
optimal context Therefore, the second ATG of the first pair is most likely to be 
the translation initiation codon but it is possible that translation is also initiated 
from the second optimal ATG, by a process of leaky scanning, or even from the 
other two, non-optimal ATG codons. The predicted ERhVl coding sequence, 
beginning at the most likely initiation ATG, extended for 6,741 bases and would 
encode a polyprotein of 2,247 amino acids. 

Alignment of the ERhVl amino acid sequence with those of other 
picornaviruses showed that it was most similar to aphthoviruses and, to a lesser 
extent, to cardioviruses in all regions of the genome (data not shown). Fig. 3 
shows a comparison of the predicted amino acid sequence of ERhVl with that of 
FMDV.OIK. The two sequences were 40% identical. The more conserved regions 
include: the 3D/polymerase (50% identity), VP4 (49% identity) and some regions 
of the 2C protein. ERhVl encoded a 2A protein of 16 amino acids, 14 of which 
were identical with those of FMDV 2A. ERhVl possessed only one copy of the 
VPg sequence. This is in contrast to FMDV which has 3 tandemly repeated, 
non-identical VPg sequences (27-29), 

Table 1 shows the proteolytic cleavage sites of ERhVl predicted from the 
amino acid alignment (Fig. 3), and compares these with those of FMDV, EMCV 
and Theilers murine encephalomyelitis virus (TMEV). Most of the ERhVl 
cleavage sites could be assigned with reasonable confidence because of significant 
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amino acid similarity with FMDV in the regions flanking the predicted cleavage 
site; an exception was the 3A/3B cleavage site where there was less sequence 
similarity. As is the case with FMDV. the predicted ERhVl 3C protease cleavage 
sites were more variable than those of the cardioviruses. EMCV and TMEV. 

Table 1. Comparison of the predicted proteolytic cleavage sites of the ERhVl 
polyprotein with those of FMDV. EMCV and TMEV. 



Cleavage sites* 

Proteins 

ERhVl FMDV EMCV TMEV 



Leader/IA(VP4) S/G K/G Q/G Q/G 

1A(VP4)/1B(VP2) A/D A/D A/D UD 

1B(VP2)/1C(VP3) E/A E/G Q/S Q/S 

1C(VP3)/1D(VP1) Q/V E/T Q/G Q/G 

1D(VP1)/2A T/N L/N E/S E/N 

2A/2B NPG/P NPG/P NPG/P NPG/P 

2B/2C Q/V Q/L Q/S Q/G 

2C/3A Q/S Q/I Q/G Q/S 

3A/3B Q/S E/G Q/G Q/G 

3B/3C E/T E/S Q/G Q/G 

3C/3D Q/G E/G Q/G Q/G 



* Cleavage data from: FMDV.OIK (Forss era/. 1984), TMEV (Pevear e/_/. 1987) 
and EMCV (Palmenberg et al. 1984). The single amino acid code is used. 

Phylogeoetic analyses 

A phylogenetic tree was derived from the nucleotide sequences of complete 
picornavirus polyproteins (Fig. 4a). Each branch of this tree was statistically, 
highly significant (PO.01), with the 95% confidence limits ranging from ±7% to 
±15% of branch lengths. ERhVl was found to be most closely related to the 
aphthoviruses, although it was clear that ERhVl was considerably more distant 
from individual members of this genus than the aphthoviruses were from each 
other. A phylogenetic tree was also derived from the nucleotide sequences of 
picornavirus polymerase genes (Fig. 4b). Each branch of this tree was statistically, 
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highly significant (PO.OI) with 95% confidence limits ranging from ±14% to 
±38% of the branch lengths. Again, ERhVl grouped with the aphthoviruses and 
the topology of the tree was the same as that obtained using data of the entire 
polyprotein (Fig. 4a). The VP I nucleotide sequences were also similarly analyzed 
(Fig. 4c). Most branches were statistically, highly significant (P<0.01). although, 
that between the ERhVl branch point and the branch point for the echovirus 
22-hepatovirus cluster was less so (PO.05). The 95% confidence limits of the 
branch lengths of this tree were considerably greater than for the other two trees, 
ranging from ±18% to ±69%. This tree did not group ERhVl with the 
aphthoviruses. With the exception of bovine enterovirus (BEV), the tree had the 
same topology as those derived from the complete polyprotein and the polymerase 
sequences. It was also apparent that picomaviruses formed three clusters: 
enteroviruses-rhinoviruses, echovirus 22-hepatovirus and 
cardioviruses-aphthoviruses-ERhV 1 . 
(1) Diagnostic reagents 

Oligonucleotide primers: We have designed short oligonucleotide 

primers and used them in polymerase chain reactions (PCR) for the diagnosis of 
ERhV infected horses. Any of the ERhV nucleotide sequence may be used for the 
'design primer sets for use as diagnostic reagents. They may be highly specific for 
ERhVl or they may be designed to be more cross reactive so as to amplify single 
strand RNA template from other ERhV types e.g., ERhV 2, 3 and 4. As a specific 
example we have used the primer set shown in Fig. 5 to diagnose ERhV disease in 
several groups of seriously ill horses in circumstances in which, despite exhaustive 
efforts, we could not isolate the virus using conventional cell culture procedures. 
We now consider ERhV a very under reported disease simply because, most of the 
time, nasal samples collected from horses experiencing severe, systemic clinical 
disease because of ERhV infection do not yield the virus in cell culture. In one 
particular group of horses, we detected the presence of ERhV by PCR and 
confirmed that the horses were both actively infected and seriously ill with ERhV 
by use of paired serum samples which showed that there was a concomitant rise in 
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ERhVl serum neutralising antibody. Vigorous attempts to isolate the virus in cell 
cultures yielded negative results. 

Oligonucleotide probes: Virus specific oligonucleotides are used as probes 
to detect the presence of the virus in infected samples from diseased horses and 
other animals. This may be especially important given the systemic nature of the 
illness i.e., it is a foot-and-mouth-like, generalized disease with virus distributed 
throughout the body in many organs and tissues; it is not just a simple "common 
cold-like" illness as the name rhinovirus implies The significance of the sequence 
in moving the virus out of the Rhinovirus genus and into a new genus proposed to 
be called " Equirhinovirus" in the Picornaviridae family does not represent merely 
a taxonomic change but represents a paradigm shift in how ERhVl and related 
viruses must now be regarded as pathogens for the horse and other animal species. 

Diagnostic antigens: Individual virion proteins, in particular VP 1 f VP2 
and VP3, can be expressed in any one of a number of heterologous expression 
systems to provide antigens to detect specific antibody to ERhVl present in blood. 
Such expression systems, which are well established for E. coli, yeast, vaccinia 
virus and baculovirus, allow for the production of large quantities of protein to a 
high degree of purity. The expressed virion proteins may be used in simple 
immunoassays, such as ELISA, to detect ERhVl specific antibody. Virion proteins 
expressed in this way also serve as effective vaccines against ERhVl disease. 
(2) Vaccines 

Production of virus like particles (VLPs): We have used the sequence 
information to construct recombinant plasmids containing the P1-2A-3C region of 
the genome (see Fig. la and Fig. 6). These plasmid constructions are of course 
critically dependent on the ERhVl sequence that has been determined although the 
strategy that we are adopting, in general, is similar to that described in J. Virol 66, 
4557-4564. Some early plasmid constructions have been inserted into £. coli and 
baculovirus expression systems based on prior art with similar viruses such as 
poliomyelitis of humans and foot-and-mouth disease virus of cattle and other cloven 
hoofed animals. The RT PCR double stranded DNA of the P1-2A-3C region of the 
ERhVl genome is transcribed, within the transformed £. coli or insect cell for 
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baculovirus, into messenger RNA as a single transcript which is then translated into 
a mini polyprotein. The 3C protease activity results in the cleavage of the mini 
polyprotein into its constituent parts namely 1A (VP4), IB (VP2). 1C (VP3) and 
1D(VP1), 2A and 3C (see Fig. la and Fig. 6) and that the VP component parts then 
5 self assemble into VLPs i.e., virus particles that lack nucleic acid and are therefore 
non infectious i.e., are unable to cause disease. Two important applications of 
ERhV VLPs are as follows: 

(a) The VLPs are very useful as highly effective, safe, high antigen-mass 
vaccines for the control ERhVl disease. If ERhVl disease is confirmed, as we 
10 believe to be the case, as significant and responsible for much hither to undiagnosed 
illness that results in many lost training days, many expensive treatments, much 
serious illness because of secondary infections following on the primary ERhVl 
infection, and much poor performance, then the utility of the vaccine based on the 
VLPs that are the subject of this invention will be very great and likely to have 
15 world-wide application. 

With improved methods for the diagnosis of ERhVl infection such as by 
PCR and ELISA as described herein, it is likely that other members of the proposed 
new Equirhinovirus genus within the family Picornaviridae including for example 
ERhV2, ERhV3, may be similarly diagnosed. Indeed suitably selected PCR primer 
20 sets based on the ERhVl sequence could be used to detect these other equine 
rhinoviruses. The sequencing of these genomes could provide a basis for their 
specific diagnosis. It is also evident that the construction of VLP's based on 
expression plasmids similar to those described herein for ERhVl, could be readily 
adapted to these other equine rhinoviruses leading for example to production of 
25 combined ERhV vaccines to cover all antigenic types as may be extant or as may 
emerge by antigenic variation, as is very much a part of the biology of FMDV, in 
the future. Polyvalent VLP vaccines incorporating a range of ERhV antigenic types 
are obvious extensions based on the work described herein. 

(b) ERhV VLPs can be used as a delivery vector that will provide not only 
30 protection against ERhV disease but will be used to deliver other therapeutic and 
useful substances to the horses following administration by parenteral or other 
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routes. Such delivery vectors can be produced by inserting into, for example the 
PI region at some appropriate site, double stranded DNA coding for antigenic 
epitopes of other virus and infectious agents of horse as well as epitopes derived 
from other non infectious sources for example reproductive hormones. 



ERhVl DIAGNOSTIC TESTS 

For the detection of ERhVl antibodies in infected or vaccinated horses 
various standard tests can be used. VLP's may be used in such tests for example 
in an ELISA test for antibody. 

Other diagnostic tests based on recombinant antigens derived from the 

ERhV 1 sequence can be devised along similar lines to those reported for FMDV 

in which the absence of protein 2C from clarified inactivated whole virus FMD, 

FMDV or FMDV VLP vaccines maybe used as the basis for distinguishing infected 

from vaccinated animals where the vaccine is a non-replicating form of ERhVl or 

a deletion mutant of ERhVl in which a particular non-structural protein gene has 

been deleted- Precedent for this comes from studies of FMDV as reported in for 

example Lubroth, Grubman, Burrage, Newman & Brown, 1996, Absence of protein 

2C from clarified foot-and-mouth disease virus vaccines provides the basis for 

distinguishing convalescent from vaccinated animals, Vaccine 14(5), 419-427. 

PREPARATION AND USE OF VIRUS-LIKE PARTICLES AND OTHER 
PROTEINS BASED ON ERhVl SEQUENCE 

From the sequence of ERhVl it is possible to clone certain segments of the 

viral genome into a variety of vectors for expression in a variety of different 

expression systems. There is a straight forward and strong literature for FMDV that 

provides a very clear precedent for what can be done for ERhV I. Examples 

include the expression of FMDV P1-2A in a baculovinis (Abrams CC & Belsham 

GJ, 1994, The antigenicity of foot-and-mouth disease virus P1-2A polyprotein and 

empty capsids produced in vaccinia virus and baculovirus expression systems. In 

Vllth Meeting of the European Study Group on the Molecular Biology of 

Picornaviruses, 6-1 1 August 1994, Korpilampi, Finland) or vaccina virus syst ms 

(Abrams CC, King AMQ & Belsham GJ, 1995, Assembly of foot-and-mouth 
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disease virus empty capsides synthesized by a vaccinia virus expression system. 
Journal of General Virology 76:3089-3098) to obtain VLPs or viral proteins. We 
have prepared similar plasmids in which P1-2A. P1-2A-3C and these two sequences 
in a myristolated form have been inserted into p fastbac I baculovirus vector 
5 (Gibco/BRL) and into a PET vector (Novogene) for expression in insect cells and 
E.coli respectively. 

These expressed products either as protein antigens or as VLPs, have utility 
as the basis for diagnostic tests or vaccines. 

Accordingly, such references are herein incorporated in support of the full 
10 description and enablement of the invention where the disclosed methods of 
preparing diagnostics, vaccines, vectors, host systems and kits are fully described 
and applicable to the like aspects of the current invention. 
(3) Applications in human medicine: 

ERhV is also a human pathogen. We have unpublished data to confirm that 
15 humans have serum neutralising antibody to ERhVl that is indicative of infection. 
One of the laboratory workers concerned with the conduct of the sequencing and 
who handled infectious vims has specific antibody in high amounts (serum 
neutralising antibody titre 1 : 640 to ERhVl). We are currently extending these 
studies and anticipate finding a significant incidence of infection in humans world 
20 wide particularly among those humans who work with horses. The improved 
diagnostic methods outlined above, perhaps also the vaccine, are expected to have 
application in human medicine. 



SUBSTITUTE SHEET (Rule 26) 



PCT/AU96/00815 

WO 97/22701 

27 



CLAIMS: 



1. A substantially pure nucleotide sequence for ERhVl being: 




CCGTCAAGCC 


CGTTGCCTGT 


ATAGCCAGGT 


AAC CGGACAG 


CGGCTTGCTG 


GATTTTCCCG 


• "l 1 c 


GTGCCATTGC 


TCTGGATGGT 


GTCACCAAGC 


TGACAAATGC 


GGAGTGAACC 


TCACAAAGCG 


* Jl3 


ACACGCCTGT 


GGTAGCGCTG 


CCCAAAAGGG 


AGCGGAACTC 


CCCGCCGAGG 


CGGTCCTCTC 


_ O C C 


TGGCCAAAAG 


CCCAGCGTTG 


ATAGCG CCTT 


TTGGGATGCA 


GGAACCCCAC 


CTGCCAGGTG 


— IOC 


TGAAGTGGAG 


TGAGCGGATC 


TCCAATTTGG 


TCTGTTCTGA 


ACTACACCAT 


TTACTGCTGT 


-TIC 


GAAGAATGCC 


CTGGAGGCAA 


GCTGGTTACA 


GCCCTGACCA 


GGCCCTGCCC 


GTGACTCTCG 


/ -3 


ACCGGCGCAG 


GGTCAAAAAT 


TGTCTAAGCA 


GCAGCAGGAA 


CGCGGGAGCG 


TTTCTTTTCC 




TTTTGTACTG ACATGATGGC 


GGCGTCTAAG 


GTGTATAGAG 


TTTGCGAGCA 


GACTCTGCTG 


45 


cimnrrrnrrn ttcgcatgat GGACAAATTC 


TTGCAAAAGA 


GAACTGTTTT 


TGTCCCCCAT 


105 


rTiT A C AAAA 
L X X ^a#%\ rwWw^ 


PAATTPCTTT 


GACTGGACTC 


CACAATTATG 


ACAATACTTG 


CTGGTTGAAT 


IBS 




AACTGACACA 


GATTCTTGGA 


ATTCGGCTTT 


TTGATGAACA 


CTTCGGCAAT 


x2d 




TCACTCGGAA 


AACAATTGAT 


TGGGTGAGTG 


ACCAGACTGG 


TATAAAAGAT 


*> a c 
285 


CTAAAATLAw 


n.f;f~A rffi r*c* 


ACTCGTGGTG 


GTGTACAAAC 


TGTGGCAACA 


TGGACACTTG 


-% A CL 


kjJVxvx x Luu 1A 




ACCCCGGTCG 


ATTACTCTAT 


GGTCTGGCCC 


CAAAGTGTGT 


a n c 


Li 1 iLi IjAI 1 


X L * Www^W X 


TGTTTCGGCA AAACCGGGAC 


ATGCAGTATT 


CTACCTTCTC 


A c c 




U» X luuKl L»lvjr 


X w X X X \ 


AAGAAAATAT 


ACCCAGAAAC 


ACCCAAAACA 


r i tr 

525 


<»»Avj<*H. x t» IAt 




L>LLL X #\X UH.V. 


TTTGAGTCAC 


TGGGCAAGGA 


CCCACCAAAG 


rnr 

5fl 5 


L X ALAL LAUA 


fTATAXtTA AAA 


AGCATTTGAG 


CTCAGTGGCG 


GAGGTACATC 


CACTCCAACA 




ACTGGCAACC 




CGGAAACAGT 


GGTTCAATTG 


TTCAAAATTT 


TTACATGCAA 


705 


CAGTAC UAuA 


A2 1 UHrti 


CGCAGACCTG 


GGAGACAATG 


TGATTAGCCC 


TGAAGGCCAG 




WjUAuLAAUi 


L X A Av XXL 


AACCTCATCA 


AGCCAATCCT 


CTGGCTTGGG 


CGGGTGGTTC 


one 


TCXAGTTTGC 




AACAAAACTA 


CTGGCTGACA 


AGAAGACAGA 


AGAGACTACA 


B H S 


AACATTGAAG 


ALAUAAi IWi 


AACAACAGTG 


GTTGGAGTCA 


CTATTATTAA 


TTCACAAGGA 


545 


TCTGTTTGGAA 


VvAnL V_ X >t>L X w 


TTACTCCAAA 


CCGGATGGTA 


GACCACCATC 


CACAGTGTCA 


LOO 5 


GACCCAGTTA 


CCAGACTTGG 


ACCCACGCTT 


TCCAGGCACT 


ACACATTTAA 


GGTAGGTGAG 




TGGCCCCATT 


CTCAATCACA 


TGGTCACGCA 


TGGATCTGTC 


CGTTGCCAGG 


TGACAAACTC 




AAGAAGATGG 


GCAGTTTTCA 


TGAGGTTGTC 


AAAGCCCACC 


ACCTGGTCAA 


GAACGGCTGG 


1X85 


GATGTGGTTG 


TGCAGGTGAA 


TCCCTCATTT 


GCTCACTCCG 


GGC CGCTGTG 


TGTAGCAGCA 


1 "5 A C 
1x43 


GTGCCGGAGT 


ACGAACACAC 


ACATGAGAAA 


GCACTCAAGT 


GGTCTGAGCT 


TGAGGAACCA 


1 JOa 


GCTTACACAT 


ACCAACAACT 


TTCAGTTTTT 


CCCCACCAGT 


TGCTAAATTT 


GAGGACAAAT 


1365 


TCATCAGTGC 


ATTTGGTGAT 


GCCCTACATT 


GGGCCAGGCC 


AACCAACAAA 


TCTGACTTTG 


1425 


CACAACCCGT 


GGACCATTGT 


TATTTTAATT 


TTGTCTGAAT 


TGACAGGACC 


TGGCCAAACT 


1 v% O C 

14 B5 


GTGCCTGTGA 


CCATGTCGGT 


GGCTCCCATC 


GATGCAATGG 


TTAATGGGCC 


TCTTCCAAAT 


•y C A C 

1545 


CCAGAGGCAC 


CGATTAGAGT 


GGTGTCTGTG 


CCTGAATCAG 


ATTCTTTTAT 


GTCTTCAGTA 


16 Ob 


CCTGATAATT 


CGACTCCACT 


ATACCCCAAG 


GTTGTGGTCC 


CACCGCGCCA 


AGTTCCTGGC 


1665 


CGGTTTACAA 


ATTTCATTGA 


TGTGGCAAAA 


CAGACATATT 


CAT TIT G TT C 


CATTTCTGGA 


1725 


AAACCTTATT 


TTGAGGTTAC 


CAACACCTCT 


GGGGACGAGC 


CACTGTTTCA 


GATGGATGTG 


1785 


TCGCTCAGTG 


CGGCAGAGCT 


ACATGGCACT 


TACGTAGCTA 


GTTTGTCATC 


AriTlTlXJCA 


184S 


CAGTACAGAG 


GCTCACTTAA 


TTTCAACTTT 


ATTTTCACTG 


GTGCAGCAGC 


CACTAAGGCA 


1905 


AAGTTTCTGG 


TTGCTTTTGT 


GCCTCCCCAC 


AGTGCAGCGC 


CCAAAACGCG 


CGATGAAGCA 


1965 


ATGGCGTGCA 


TCCATGCCGT 


GTGGGATGTT 


GGCTTGAACT 


CAGCTTTTTC 


TTTTAATGTA 


2025 


C CTTATCCCT 


CCCCTGCTGA 


CTTCATGGCC 


GTTTATTCTG 


CGGAACGGAC 


GGTTGTGAAT 


2085 


GTCTCTGGAT 


GGCTTCAAGT 


TTATGCACTA 


ACAGCTCTAA 


CTTCAACTGA 


CATTG CCGTG 


2X45 


AACAGTAAAG 


GCCGTGTGCT 


GGTTGCTGTT 


TCCGCCGGCC 


CAGACTTCTC 


CCTTCGTCAC 


2205 
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CCGGCGGACC TGCCCGACAA G CAGGTT AC C AATGTGGGAG AGGATGGTGA ACCCGGTGAG 226 5 
ACAGAGCCTC GTCATG CTTT GTCACCCGTG GACATGCACG TGCACACAGA TGTCAGTTTC 23 2 5 
TTGCTTGACC GGTTCTTTGA TGTTGAGACA CTTGAGCTTT CAAATTTGAC AGGTTCTCCT 23 BS 

GCCACACATG TTCTGGATCC GTTTGGCTCG ACTGCCCAAC TGGCTTGGGC ACGTCTGCTA 2 44 5 

AACACTTGCA CCTACTTCTT TTCTGATTTG GAATTGTCAA TCCAGTTTAA ATTTACCACC 2 505 

ACTCCGTCCT CTGTTGGAGA GGGCTTTGTG TGGGTGAAGT GGCTCCCTGT TGGAGCACCA 2565 

ACCAAGACCA CAGATG CTTG GCAGTTAGAA GGAGGTGGAA ATTCAGTTAG AATTCAAAAA 262 5 

TTGGCCGTTG CAGGGATGTG CCCCACTGTT GTGTTCAAGA TTGCAGG CTC CCGTTCACAA 2 6 85 

GCCTGTGCTT CAGCGTTGCC ATATACATCA ATGTGGCGTG TTGTG CCAGT CTTTTACAAT 274 5 

GGCTGGGGTG CACCTACCAA AGAAAAGGCA ACCTACAATT GGCTTCCTGG TGCACACTTT 28 05 

GGTTCCATCT TGCTGACTTC TGATGCGCAT GATAAAGGAG GGTGCTACTT GCGGTATGCT 28 65 

TTCCGCGCGC CAGCGATGTA TTGCCCTCGA CCCATTCCGC CGGCTTTTAC GCGTCCAGCG 292 5 

GACAAAACCA GACATAAATT TCCCACTAAC ATCAACAAAC AGTGTACTAA TTACTCTCTC 2985 

CTCAAATTGG CTGGAGATGT TGAGAGCAAC CCTGGCCCCA CTATTTTTTC CAAAGCATCA 304 5 

GCAGACCTGA ATGCCTTGTC AACGTCGCTA GGTGAATTGA CTGGCATGCT AAAAGATCTT 310 5 

AAAGCCAAGG CAGAAACTTA TTCCCCGTTT TACAAAATGG CCAAAATGCT TTTCAAACTT 316 5 

GCAACACTAG CTGTGGCAGC TATGAGGACA AAGGACCCAG TAGTGGTGGT TATGTTGATT 322 5 

GCTGATTTCG GATTGGAGGT CTTTGACACT GGGTTTTTCT TTTC CTACTT TCAAGAGAAG 3 28S 

TTGCAGCCTT ATATGAAAAC TATTCCTGGT AAGATTTCTG ATTTGGTCAC TGATG CGGCT 334S 

ACGGCTGCCG CCCAAATTCC AAAGGGAGTG TATTCTTTTG TGTCGTCATT TTTCGAAACG 34 05 

CCTGAAGGAG TGGTTGAGAA GCAGGTGTCT CTTCGGACAG TGAATGACAT ATTTGCTTTG 3465 

CTTAAAAATT CTGATTGGTT CATAAAGACT CTTGTTGCCC TCAAGAAATG GCTGACATCC 352 S 

TGGTTTGCTC AAGAACAACA GGCAGATGAT GCGCTCTATT CAGAATTGGA AAAATXTCCC 3 585 

TTGTACAAGT TAAAATTGAA GGAACCTGAT ACTCAAGAGG AAGCGCGCCA GTGGTTTAAA 3645 

GACATGCAGC AGCGTGCTCT CGCTGTGAAG GACAAAGGTC TCTTTTCCCT CCTGCAAATT 3 705 

CCATTAGTTA ACTTGCCCCA GAGCCGTCCA GAGCCCGTTG TATGCGTCCT TCGGGGCGCA 3765 

TCAGGGCAAG GCAAATCTTA TTTGGCAAAT CTGATGG CTC AAGCAATTTC GCTTCTCTTG 3 825 

GTTGGCAAGC AGGACAGTGT GTGGAGTTGT CCTCCTGACC CCACATATTT TGATGGCTAT 3 885 

AACGGACAGG CTGTGGTGAT TATGGATGCA TTGGGCCAGG ATCCGAATGG TGCTGACTTT 3945 

AAATATTTTT GCCAGATGGT CTCTACAACA GCTTTTGTAC CACCTATGGC CCATTTGGAT 4005 

G ATAAAGG CA TTC CATTT AC TTCTC CTGTT GTTATTTGTA CTACAAATTT GCATTCATCT 4065 

TTTACCCCTA TTACTGTTTC TTGT CCTGAA GCTCTTAAGA GGAGGTTTCG GTTTGATGTG 4125 

ACGGTGTCCG CTAAACCGGG CTTTGTGCGC ACTGTTGGTT CAAACCAGCT TTTGAATCTC 418 5 

CCACTTGCTC TTAAGCCAGC TGGT CTTCCC CCACACCCTA TCTTTGAAAA TGACATGCCC 424S 

ATTATAAATG GGCAGGCTGT TAAATTGGCT CTTTCTG GTG GAGAAGTGAC AGCT TTTGAG 4305 

CTTATTGAGA TGATACTGTC AGAAGTTCAA AACAGACAAG ACACACACAA AATGC CCATT 4 365 

TTTAAACAAT CATGGTCTGA TTTGTTCAGA AAGTGTACAA CTGATGAGGA ACAGAAAATG 4425 

TTGCA GTTTT TAATTGACAA TAAAGATTCA GAAATTCTCA GGGCGTTTGT TTCAGAACGC 448 5 

TCCATTTTAC TACATGAAGA GT AT CTTAAA TGGGAGT CAT ATATGACCAG GAGAGCCAAG 4 545 

TTTCACCGCC TGGCTGCTGA TTTTGCTATG TTTCTATCCA TTCTTACTTC ACTGATTGTT 4605 

ATTTTTTGTT TAGTTTATTC TATGTATCAA CTTTTTAAGA CCCCTGACGA GCAATCAGCT 4 665 

TATGATCCTT CAACTAAGCC AAAACCAAAG ACCCAGGAAG TGAAAACACT GAAGATTAGG 4 72 S 

ACTGAGACTG GTGTACCAGC AACTGACTTG CAACAATCCA TCATGAAAAA TGTTCAGCCA 478 5 

ATTGAGCTTT ACCTT GAC AA TGAATTGGTT ACTGACTGCT CTGCCTTGGG T GTTTATG AC 484 5 

AATTCATATT TGGTGCCCCT TCATTTGTTT GAATTTGATT TTGAT AC CAT TGTGCTTGGT 4 905 

GGACGTCATT ACAAGAAAGC TGAGTGTGAG AAGGTAGAGT TTGAGCTTGA AG TGAATGG A 4 965 

GACGTGGTGT CATCAGATGC GTGTCTACTT CGAGTGTCAT CGGGGCCTAA AGTTAGAAAT 5025 



cTiDcrrrn rpR SHEET (Rv 1 - w 



WO 97/22701 



PCT/AU96/00815 



29 



ATTG'ITCATC 


1 11 lTACAAA 


TGAAATTGAA 


TTGAAGAAAA 


TGACCCAAGT 


GACAGGAATC 


5085 


ATGAATTCAC 


CACACCAGGC 


ACGCACTGTG 


TTTTTTGGCA 


GTTTITI'GAC 


AGTGAGGAAG 


5145 


TCCATCTTAA 


CATCGGATGG 


GACTGTAATG CCCAATGTTT 


TGTCCTATGC 


CGCTCAGACC 


5205 


TCGCGTGGGT 


ATTGTGGCGC 


TGCAATTGTT 


GCTGGCTCAC 


CTGCCCGCAT 


AATTGGTATC 


5265 


CATTCAGCTG 


GCACTGGATC 


TGTTGCATTT 


TGCTCCCTGG 


TGTCCAGAGA 


CGCGCTGGAG 


5325 


CAACTCTGGC 


CCCAGAAACA 


GGGCAACGTT 


AGTCG CCTTG 


ATGACGATGT 


GAGGGTGTCT 


5385 


GTTCCGCGCC 


GCTCCAAATT 


GGTGAAATCA 


TTGGCTTACC 


CCATTTTCAA 


AC CTG ACTAT 


5445 


GGCCCAGCGC 


CACTCTCTCA 


ATTTGACAAG 


CGCCTGTCAG 


ACGGCGTGAA 


GCTGGATGAA 


5505 


GTGGTTTTTG 


CTAAACATAC 


TGGAGACAAG 


GAGATTTCCG 


CACAGGACCA 


GAAATGGCTC 


S565 


TTGCGTGCGG 


CGCATGTATA 


CGCCCAGAAG 


GTTTTCTCCC 


GGATTGGATT 


TGACAACCAG 


5625 


GCTTTGACTG 


AAAAAGAGGC 


CATTTGTGGC 


ATTC CTGGCC 


TTGACAAGAT 


GGAGCAGGAC 


5685 


ACCGCTCCCG 


GGCTGCCCTA 


TGCTCAGCAA 


AATAAGAGAA 


GGAAAGACAT 


CTC5TGATTTT 


5745 


GAAGAGGGCC 


GGCTGAAGGG 


CGCCGAACTC 


CAAAAGGACA 


GATTTATGGC 


TGGTGACTAC 


5805 


TCTAATTTGG 


TCTATCAATC 


ATTTTTGAAA 


GATGAGATCC 


GCC CACTTGA 


GAAAGTTAGG 


5865 


GCTGGAAAGA 


CCCGCCTGAT 


TGACGTGCCG 


CCGATGCCCC 


ATGTGGTGGT 


TGGTAGGCAG 


5925 


CTCTTGGGCC 


GGTTTGTGGC 


AAAATTTCAT 


GAAGCAAATG 


GATTTGACAT 


TGGCTCAGCC 


598S 


ATTGGATGTG 


ACCCAGATGT 


GOACTGGACT 


CGGTTTGGCC 


TCGAGTTGGA 


GCGZTICAGG 


6045 


TATGTATATG 


CCTGTGACTA 


CTCACGGTTC 


GATGCCAACC 


ATGCAGCTGA 


TGCAATGAGA 


6105 


GTTGTGCTTA 


actacttttt 


CTCTGAGGAC 


CACGGTTTCG 


ACCCTGGTGT 


GCCTGCTVI 1 


6165 


ATTGAGTCAC 


TGGTTGATTC 


AGTGCATGCC 


TATGAAGAGA 


AAAGGTATAA 


CATCTACGGT 


6225 


GGCTTGCCAT 


CCGGGTGTTC 


CTGCACATCA 


ATTTTGAATA 


CCATCTTGAA 


CAATGTTTAC 


62S5 


ATTCTTGCAG 


CTATGATGAA 


GGCTTATGAG 


AATTTTGAGC 


CAGATGACAT 


TCAGGTCATT 


6345 


TGCTATGGGG 


ACGACTGCCT 


CATTGCTTCT 


GATTTTGAAA 


TTGATTTCCA 


AGAACTGGTG 


6405 


CCTGTCTTTT 


CTAUTT'ITGG 


ACAGGTAATA 


ACTACAGCTG 


ACAAGACTGA 


TTTTTTTAAA 


6465 


CTGACAACGC 


TTTCGGAGGT 


GACCTTCCTT 


AAGCGCGCTT 


TTGTTCTGAC 


GGCCTTTTAC 


6525 


AAGCCAGTGA 


TGGATGTGAA 


GACCCTTGAA 


GCAATCTTAA 


GCTTTGTTCG 


CCCAGGCACA 


6585 


CAGGCTGAAA 


AGCTCCTGTC 


CGTGGCGCAG 


TTGGCAGGCC 


ACTGCGAACC 


GGAGCAGTAT 


6645 


GAGCG CCTGT 


TTGAGCC CTT 


TGCTGGGATG 


TATTTCGTCC 


CTACTTGGCG 


ACTTGCGCCT 


6705 


G CAGTGGTTG 


ATGAAGCTTG 


GATGCTAAAT 


TCTTTTTGAC 


TT-ixsi-rriTC 




6765 


TTTAGGCTTT 


TAAGGTGTTA 


AGTTTAAAGG 


TTAAGAGTTT 


TTAGAAGTTA 


AGATAGAGTT 


6825 



TA GTTTTTA G TTTTGAGC-poly <A) 

as disclosed in Fig. 2 and functional equivalents of said nucleotide sequence 
including naturally occurring derivatives, variants, degeneracy equivalents and 
deletion mutants thereof. 
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2. A substantially pure amino acid sequence being: 

MAASKVYRVCE 
LQKRTVFVPHL 
TCWLKALTQLT 
RGLFTRKTIDW 
PPLVVVYKLWQ 
ITLWSGPKVCL 
VFYLLTSEGWI 
EDVLVFAPYDF 
L + VP 4 

SKAFEXiSGGGT 
GSIVQNFY MQQ 
S PEGQGSNTSS 

VP4 4 VP 2 
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VP 2 4 VP 3 
PNPEAPIRVVSV 
STPLYPKVVVPP 
AKQTYSFCS ISG 
PLFQMDVSLSAA 
FAQYRGSLNFNF 
VAFVPPHSAAPK 
DVGLKSAFS FMV 
AERTVVNVSGWL 
AVNSKGRVLVAV 

VP 3 4 VP1 
LPDKQVTMVGED 
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CLVYSMYQLFKT 

KPKTQEVKTLKI 

SIMKNVQPIELYL 
VYDNSYLVPLHLF 
HYKKAECEKVEFE 
CLX-RVSSGPKVRN 
KMTQVTG IMNS PH 
VRKS ILTSDGTVM 
GYCGAAIVAGS PA 
VAFCSLVSRDALE 

LDDDVRVSVPRRS 
PDYGPAPLSQFDK 
FAKHTGDKEISAQ 
AQKVFSRIGFDNQ 
GLDKMEQDTAPGL 
CDFEEGRLKGAE L 
LVYQSFLKDEIRP 
DVPPMPHVVVGRQ 
NGFDIGSAIGCDP 
RFRYVYACDYSRF 
LNYFFSEDHGFDP 
VHAYE EKRYNIYG 
NTILNNVYILAAM 
QVICYGDDC LIAS 
FSSFGQVITTADK 
TFLKRAFVLTAFY 
FVRPGTQAEXL 
EOYERLFEPFAGM 

3D 

VDEAWMLNS F 
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3. A protein or vims like particle incorporating VP I. derived from ERhVl and 
having the following amino acid sequence: 
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4. A protein or virus like particle incorporating VP2, derived from ERhVl and 
having the following amino acid sequence: 
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5. A protein or virus like panicle incorporating VP3. derived from ERhVl and 
having the following amino acid sequence: 

APIRVVSVPE50S FMSSVPDNSTPL 
YPKVVVPPRQVPGRFTNFIDVAKQT 
YSFCS ISGKPYFEVTNTSGDEPLFQ 
MDVSLSAAELHGTYVASLS SFFAQY 
RGSL NFNFI FTGAAATKAKFLVAFV 
PPHSAAPKTRDEAMACIHAVWDVGL 
NSAFSFNVPYPS PADFMAVYSAERT 
VVNVSGWLQVYALTALTSTDIAVNS 
KGRVLVAVSAGPDFSLRHPADLPDK 

Q 



6. A protein or virus like particle incorporating VP4, derived from ERhVl and 
having the following amino acid sequence: 



G G G T 
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T G N Q 
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7. A substantially pure nucleotide sequence for VP1 being: 




GTTACCAATG 


TGGGAGAGGA 


TGGTGAACCC 


GGTGAGACAG 


AGCCTCGTCA 


TG CTTTGTCA 


CCCGTGGACA 


TGCACGTGCA 


CACAGATGTC 


AGTTTCTTGC 


TTGACCGGTT 


CTTTGATGTT 


GAGACACTTG 


AGCTTTCAAA 


TTTGACAGGT 


TCTCCTGCCA 


CACATGTTCT 


GGATCCGTTT 


GGCTCGACTG 


CCCAACTGGC 


TTGGGCACGT 


CTGCTAAACA 


CTTGCACCTA 


CTTCTTTTCT 


GATTTGGAAT 


TGTCAATCCA 


GTTTAAATTT 


ACCACCACTC 


CGTCCTCTGT 


TGGAGAGGGC 


TTTGTGTGGG 


TGAAGTGGCT 


CCCTGTTGGA 


QCACCAACCA 


AGACCACAGA 


TGCTTGGCAG 


TTAGAAGGAG 


GTGGAAATTC 


AGTTAGAATT 


CAAAAATTGG 


CCGTTG CAGG 


GATGTGCCCC 


ACTGTTGTGT 


TCAAGATTGC 


AGGCTCCCGT 


TCACAAGCCT 


GTGCTTCAGC 


GTTGCCATAT 


ACATCAATGT 


GGCGTGTTGT 


GCCAGTCTTT 


TACAATGGCT 


GGGGTGCACC 


TACCAAAGAA 


AAGGCAACCT 


ACAATTGGCT 


TCCTGGTGCA 


CACTTTGGTT 


CCATCTTGCT 


GACTTCTGAT 


GCGCATGATA 


AAGGAGGGTG 


CTACTTGCGG 


TATGCTTTCC 


GCGCGCCAGC 


GATGTATTGC 


CCTCGACCCA 


TTCCGCCGGC 


TTTTACG CGT 


CCAGCGGACA 


AAACCAGACA 


TAAATTTCCC 


ACTAACATCA 


ACAAACAGTG 


TACT 
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and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 



8. A substantially pure nucleotide sequence for VP2 being: 



GACAAGAAGA 


CAGAAGAGAC 


TACAAACATT 


GAAGACAGAA 


TTGAAACAAC 


AGTGGTTGGA 


GTCACTATTA 


TTAATTCACA 


AGGATCTGTT 


GGAACAACCT 


ACTGTTACTC 


CAAACCGGAT 


GGTAGACCAC 


CATCCACAGT 


GTCAGAC CCA 


GTTACCAGAC 


TTGGACCCAC 


GCTTTCCAGG 


CACTACACAT 


TTAAGGTAGG 


TGAGTGGCCC 


CATTCTCAAT 


CACATGGTCA 


CGCATGGATC 


TGTCCGTTGC 


CAGGTGACAA 


ACTCAAGAAG 


ATGGGCAGTT 


TTCATGAGGT 


TGTCAAAGCC 


CACCACCTGG 


TCAAGAACGG 


CTGGGATGTG 


GTTGTGCAGG 


TGAATCCCTC 


ATTTGCTCAC 


TCCGGGCCGC 


TGTGTGTAGC 


AGCAGTG CCG 


GAGTACGAAC 


ACACACATGA 


GAAAGCACTC 


AAGTGGTCTG 


AGCTTGAGGA 


ACCAGCTTAC 


ACATACCAAC 


AACTTTCAGT 


TTTTCCCCAC 


CAGTTGCTAA 


ATTTGAGGAC 


AAATTCATCA 


GTGCATTTGG 


TGATGCCCTA 


CATTGGGCCA 


GGCCAACCAA 


CAAATCTGAC 


TTTGCACAAC 


CCGTGGACCA 


TTGTTATTTT 


AATTTTGTCT 


GAATTGACAG 


GACCTGGCCA 


AACTGTGCCT 


GTGACCATGT 


CGGTGGCTCC 


CATCGATGCA 


ATGGTTAATG 


GGCCTCTTCC 


AAATCCAGAG 









and fiinctional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 



9. A substantially pure nucleotide sequence for VP3 being: 



GCACCGATTA 


GAGTGGTGTC 


TGTGCCTGAA 


TCAGATTCTT 


TTATGTCTTC 


AGTACCTGAT 


AATTCGACTC 


CACTATACCC 


CAAGGTTGTG 


GTCCCACCGC 


GCCAAGTTCC 


TGGCCGGTTT 


ACAAATTTCA 


TTGATGTGGC 


AAAACAGACA 


TATTCATTTT 


GTTCCATTTC 


TGGAAAACCT 


TATTTTGAGG 


TTACCAACAC 


CTCTGGGGAC 


GAGCCACTGT 


TTCAGATGGA 


TGTGTCGCTC 


AGTGCGGCAG 


AGCTACATGG 


CACTTACGTA 


GCTAGTTTGT 


CATCATTTTT 


TGCACAGTAC 


AGAGGCTCAC 


TTAATTTCAA 


CTTTATTTTC 


ACTGGTGCAG 


CAGCCACTAA 


GGCAAAGTTT 


CTGGTTGCTT 


TTGTGCCTCC 


CCACAGTGCA 


GCGCCCAAAA 


CGCGCGATGA 


AGCAATGGCG 


TGCATCCATG 


CCGTGTGGGA 


TGTTGGCTTG 


AACTCAGCTT 


1TTCTTTTAA 


TGTACCTTAT 


CCCTCCCCTG 


CTGACTTCAT 


GGCCGTTTAT 


TCTGCGGAAC 


GGACGGTTGT 


GAATGTCTCT 


GGATGGCTTC 


AAGTTT ATG C 


ACTAACAGCT 


CTAACTTCAA 


CTGACATTGC 


CGTGAACAGT 


AAAGGCCGTG 


TGCTGGTTGC 


TGTTTCCGCC 


GGCCCAGACT 


TCTCCCTTCG 


TCACCCGGCG 


GACCTGCCCG 


ACAAGCAG 
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and functional equivalents of said nucleotide sequence including naturally occurring 
derivatives, variants and degeneracy equivalents. 

10. A substantially pure nucleotide sequence for VP4 being: 

GGCGGAGGTA CATCCACTCC AACAACTGGC AACCAAAACA TGTCCGGAAA CAGTGGTTCA 

5 ATTGTTCAAA ATTTTTACAT GCAACAGTAC CAGAATTCAA TTGACGCAGA CCTGGGAGAC 

AATGTGATTA GCCCTGAAGG CCAGGGCAGC AACACTAGTA GTTCAACCTC ATCAAGCCAA 

TCCTCTGGCT TGGGCGGGTG GTTCTCTAGT TTGCTGAACC TTGGAACAAA ACTACTGGCT 

and functional equivalents of said nucleotide sequence including naturally occurring 
0 derivatives, variants and degeneracy equivalents. 



1 1 . Oligonucleotide primers derived from the nucleotide sequence of claim I 
being highly specific for ERhVl or cross-reactive with other ERhV types. 

12. An oligonucleotide primer according to claim 11 having the following 
nucleotide sequence: 

5 VP IF 5' GTTGTGTTCAAGATTGCAGGC 3' 

13. An oligonucleotide primer according to claim 1 1 having the following 
nucleotide sequence: 

VP1R1 5' TTGCTCTCAACATCTCCAGC 3' 

14. An oligonucleotide primer according to claim 11 having the following 

D nucleotide sequence: 

VP1R2 5' TAGCACCCTCCTTTATCATGCG 3' 

15. Oligonucleotide probes derived from the nucleotide sequence of claim 1. 

16. Diagnostic reagents, methods and kits characterised by the oligonucleotide 
primers and probes of claims 11 to 15. 

17. Antigens comprising any one or a combination of the non-capsid proteins, 
being other than the individual VP1 to VP4 proteins, that are cleavage products of 
the polypeptide of claim 2. 
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18. Vaccines characterised by the incorporation of any one of a combination of 
virion proteins VP 1 to VP4. 

19. Vectors characterised by the incorporation of any one or a combination of 
virion proteins VP1 to VP4. 

20. A diagnostic test for the detection of antibodies to ERhVl in blood of horses 
and any other animal species characterised by the use of the antigens of claim 17. 

21. A diagnostic test according to claim 20 being an enzyme linked 
immunosorbent assay. 

22. A test to distinguish horses infected with ERhVl in which said virus had 
replicated from horses which have been vaccinated with the vaccine of claim 1 8 
comprising the steps of applying an antigen of claim 17 to a horse and testing for 
an immunoreaction thereto, wherein a positive immunoreaction would indicate that 
said horse had been infected with ERhVl and a negative immunoreaction would 
indicate that said horse has not been infected with ERhVl. 

23. Recombinant piasmids comprising nucleotide sequences and subsequences 
derived from the nucleotide sequence of claim 1. 

24. A recombinant plasmid according to claim 22 comprising the P1-2A-3C 
region of the ERhVl genome. 

25. A host system characterised comprising the nucleotide sequence of claim 1 
or part thereof. 

26. A process for producing a protein product derived from ERhVl comprising 
the steps of selecting out a gene of interest from the ERhVl nucleotide sequence 
of claim 1 and expressing said protein product in a suitable host system. 
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FIGURE 2b 



_ 790 -770 -750 

TAAGTAAAACGCTGTAACTGCATGATTTGCGCCTGTAGCGCCAGTAAAACGCAGAAACCA 
-730 -710 "690 

CAAGCAAAAACCTGTAGCGTCAGTAAAACGCGCACATTCACATACAGAGCTTCCCGGCTT 
-670 -650 -630 

TAAGGGTTACTGCTCGTAATGAGAGCACATGACAACTTGTCGAGATTACGGCAACTGTCA 
-610 -590 "570 

CGGGAGAGAGGAGCCCGTTTTCGGGCACTTGTCTCCTAAACAATGTTGGCGCGCATTTGC 
-550 -530 "510 

GCGiKi^i^SSffr'rTTTCAGCCCCCTGTCATTGACTGGTCGAAGCGTTCGCAATAAGACT 
-490* -470 -450 

GGTCGTCACTTGGCTGTTCTATCGTTTCAGGCTTTAGCGCGCCCTTGCGCGGCGGGCCGT 
-430 -410 "390 

CAAGCCCGTGCGCTGTATAGCGCCAGGTAACCGGACAGCGGCGTGCTGGATTTTCCCGGT 
-370 -350 "330 

GCCATTGCTCTGGATGGTGTCACCAAGCTGACAAATGCGGAGTGAACCTCACAAAGCGAC 
-310 -290 -270 

ACGCCTGTGGTAGCGCTGCCCAAAAGGGAGCGGAACTCCCCGCCGAGGCGGTCCTCTCTG 
-250 -230 "210 

GCCAAAAGCCCAGCGTTGATAGCGCCTTTTGGGATGCAGGAACCCCACCTGCCAGGTGTG 
-190 -170 -ISO 

AAGTGGAGTGAGCGGATCTCCAATTTGGTCTGTTCTGAACTACACCATTTACTGCTGTGA 
-130 -HO -90 

AGAATGCCCTGGAGGCAAGCTGGTTACAGCCCTGACCAGGCCCTGCCCGTGACTCTCGAC 

-70 -50 -30 

CGGCGCAGGGTCAAAAATTGTCTAAGCAGCAGCAGGAACGCGGGAGCGTTTCTTTTCCTT 

-10 10 30 

TTGTAOTGA CATGATGG CGGCGTCTAAQGTGTATAOAGTTTGCGAGCAGACTCTGCTGGC 

M A ASKVYRVCEQTLLA 

50 70 
AOGTCCCCTTCO CATGATGC ACAAA 

GAVRMMDK 
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